Generative AI
May 14, 2024
Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model
With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.
1 MIN READ
May 14, 2024
NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support
NVIDIA today announced the latest release of NVIDIA TensorRT, an ecosystem of APIs for high-performance deep learning inference. TensorRT includes inference...
7 MIN READ
May 13, 2024
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2
In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...
11 MIN READ
May 13, 2024
Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1
Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...
8 MIN READ
May 13, 2024
Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia
At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...
3 MIN READ
May 12, 2024
Enabling Quantum Computing with AI
Building a useful quantum computer in practice is incredibly challenging. Significant improvements are needed in the scale, fidelity, speed, reliability, and...
6 MIN READ
May 12, 2024
Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing
In the rapidly evolving field of software development, AI tools such as chatbots and GitHub Copilot have significantly transformed how developers write and...
8 MIN READ
May 10, 2024
Dynamic Control Flow in CUDA Graphs with Conditional Nodes
CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and...
7 MIN READ
May 08, 2024
Amdocs Accelerates Generative AI Performance and Lowers Costs with NVIDIA NIM
Telecommunications companies (telcos) are leveraging generative AI to increase employee productivity by automating processes, improving customer experiences,...
10 MIN READ
May 08, 2024
Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available
In the fast-evolving landscape of generative AI, the demand for accelerated inference speed remains a pressing concern. With the exponential growth in model...
9 MIN READ
May 08, 2024
Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints
Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more...
13 MIN READ
May 03, 2024
Explainer: What Is a Vector Database?
A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...
1 MIN READ
May 03, 2024
Visual Language Intelligence and Edge AI 2.0
VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...
8 MIN READ
May 03, 2024
Visual Language Models on NVIDIA Hardware with VILA
Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...
11 MIN READ
May 01, 2024
Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD
With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...
5 MIN READ
Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ